Abstract
Blind Source Separation (BSS) arises in a variety of fields in speech processing such as speech enhancement, speakers diarization and identification. Generally, methods for BSS consider several observations of the same recording. Single microphone analysis is the worst underdetermined case, but, it is also the more realistic one. In this article, the autoregressive structure (short term prediction) and the periodic signature (long term prediction) of voiced speech signal are modeled and a linear state space model with unknown parameters is derived. The Expectation Maximization (EM) algorithm is used to estimate these unknown parameters and therefore help source separation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cichocki, A., Thawonmas, R.: On-line algorithm for blind signal extraction of arbitrarily distributed, but temporally correlated sources using second order statistics Neural Process. Neural Process. Lett. 12(1), 91–98 (2000)
Barros, A.K., Cichocki, A.: Extraction of specific signals with temporal structure. Neural Comput. 13(9), 1995–2003 (2001)
Tordini, F., Piazza, F.: A semi-blind approach to the separation of real world speech mixtures. In: Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN 2002, vol. 2, pp. 1293–1298 (2002)
Smith, D., Lukasiak, J., Burnett, I.: Blind speech separation using a joint model of speech production. IEEE Signal Processing Letters 12(11), 784–787 (2005)
Chu, W.C.: Speech coding algorithms-foundation and evolution of standardized coders. John Wiley and Sons, NewYork (2003)
Feder, M., Weinstein, E.: Parameter estimation of superimposed signals using the EM algorithm. IEEE Trans. Acoust., Speech, Signal Processing 36, 477–489 (1988)
Gannot, S., Burshtein, D., Weinstein, E.: Iterative-batch and sequential algorithms for single microphone speech enhancement. In: ICASSP 1998, pp. 1215–1218. IEEE, Los Alamitos (1998)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Society B 39, 1–38 (1977)
Gao, W., Tsai, S., Lehnert, J.: Diversity combining for ds/ss systems with time-varying, correlated fading branches. IEEE Transactions on Communications 51(2), 284–295 (2003)
Couvreur, C., Bresler, Y.: Decomposition of a mixture of Gaussian AR processes, Acoustics, Speech, and Signal Processing. In: 1995 International Conference on ICASSP 1995, vol. 3, pp. 1605–1608 (1995)
Christensen, M., Jakobsson, A., Juang, B.H.: Multi-pitch estimation, Morgan & Claypool (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bensaid, S., Schutz, A., Slock, D.T.M. (2010). Single Microphone Blind Audio Source Separation Using EM-Kalman Filter and Short+Long Term AR Modeling. In: Vigneron, V., Zarzoso, V., Moreau, E., Gribonval, R., Vincent, E. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2010. Lecture Notes in Computer Science, vol 6365. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15995-4_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-15995-4_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15994-7
Online ISBN: 978-3-642-15995-4
eBook Packages: Computer ScienceComputer Science (R0)